Content Based Search Engine for Historical Calligraphy Images

نویسندگان

  • Xiafen Zhang
  • Vijayan Sugumaran
چکیده

Paper collections of historical calligraphy objects in Libraries and museums are scanned into document images to serve the academic society. However, these digitized collections are in image format, lacking the technology to search by image content. This paper proposes a search engine for searching calligraphy image content. First, 2503 page images are segmented into characters and components. Second, characters are interactively labeled and features are extracted to build a calligraphy database. When an image search query is submitted, coarse features are first extracted and used to prune the long list of calligraphy characters into a shorter list. Then fine shape features are employed to determine the most similar characters. iDistance and NB-Tree are used to create the high dimensional index. The efficiency of the algorithm has been demonstrated through experiments with 110,737 individual calligraphic character images. This research provides a demonstration of the potential use of calligraphy content search on the web. Content Based Search Engine for Historical Calligraphy Images

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparing between the impacts of text based indexing and folksonomy on ranking of images search via Google search engine

Background and Aim: The purpose of this study was to compare the impact of text based indexing and folksonomy in image retrieval via Google search engine. Methods: This study used experimental method. The sample is 30 images extracted from the book “Gray anatomy”. The research was carried out in 4 stages; in the first stage, images were uploaded to an “Instagram” account so the images are tagge...

متن کامل

مرور مؤثر نتایج جستجوی تصاویر با تلخیص بصری و متنوع از طریق خوشه‌بندی

With unprecedented growth in production of digital images and use of multimedia references, requirement of image and subject search has been increased. Systematic processing of this information is a basic prerequisite for effective analysis, organization and management of it. Likewise, large collections of images have been made available on the Web and many search engines have provided the poss...

متن کامل

Networks: Spring 2007 Keyword-based Advertising 1 Keyword-based Advertising

The problem of Web search, as traditionally formulated, has a very " pure " motivation: it seeks to take the content people produce on the Web and find the pages that are most relevant, useful, or authoritative for any given query. However, it soon became clear that a lucrative market existed alongside this for combining search with advertising, targeted to the queries that users were issuing. ...

متن کامل

1 Content - based Image Retrieval

This is a project report for the graduate capstone course of Search Engine Architecture at NYU. In this project, we implemented a basic image search engine based on mere image content. In other words, we use an image to search other similar images. We don’t take into consideration the text information related to images, but just the image content. As the past few years have seen amazingly succe...

متن کامل

Chinese Brush Calligraphy Character Retrieval and Learning

Chinese brush calligraphy is a valuable civilization legacy and a high art of scholarship. It is still popular in Chinese banners, newspaper mastheads, university names, and celebration gifts. There are Web sites that try to help people enjoy and learn Chinese calligraphy. However, there lacks advanced services such as content-based retrieval or vivid writing process simulation for calligraphy ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJIIT

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2014